Search CORE

7 research outputs found

Extension of TSVM to Multi-Class and Hierarchical Text Classification Problems With General Losses

Author: Sellamanickam Sundararajan
Selvaraj Sathiya Keerthi
Shevade Shirish
Publication venue
Publication date: 01/11/2012
Field of study

Transductive SVM (TSVM) is a well known semi-supervised large margin learning method for binary text classification. In this paper we extend this method to multi-class and hierarchical classification problems. We point out that the determination of labels of unlabeled examples with fixed classifier weights is a linear programming problem. We devise an efficient technique for solving it. The method is applicable to general loss functions. We demonstrate the value of the new method using large margin loss on a number of multi-class and hierarchical classification datasets. For maxent loss we show empirically that our method is better than expectation regularization/constraint and posterior regularization methods, and competitive with the version of entropy regularization method which uses label constraints

arXiv.org e-Print Archive

Open Access Repository of IISc Research Publications

Asymptotic behavior of the optimal poles of linear optimal regulators.

Author: Keerthi Selvaraj Sathiya
Publication venue: Scholars\u27 Mine
Publication date: 01/01/1982
Field of study

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Semi-supervised SVMs for classification with unknown class proportions and a small labeled dataset

Author: Bhar Bigyan
Sellamanickam Sundararajan
Selvaraj Sathiya Keerthi
Shevade Shirish
Publication venue: Association for Computing Machinery
Publication date: 01/01/2011
Field of study

In the design of practical web page classification systems one often encounters a situation in which the labeled training set is created by choosing some examples from each class; but, the class proportions in this set are not the same as those in the test distribution to which the classifier will be actually applied. The problem is made worse when the amount of training data is also small. In this paper we explore and adapt binary SVM methods that make use of unlabeled data from the test distribution, viz., Transductive SVMs (TSVMs) and expectation regularization/constraint (ER/EC) methods to deal with this situation. We empirically show that when the labeled training data is small, TSVM designed using the class ratio tuned by minimizing the loss on the labeled set yields the best performance; its performance is good even when the deviation between the class ratios of the labeled training set and the test set is quite large. When the labeled training data is sufficiently large, an unsupervised Gaussian mixture model can be used to get a very good estimate of the class ratio in the test set; also, when this estimate is used, both TSVM and EC/ER give their best possible performance, with TSVM coming out superior. The ideas in the paper can be easily extended to multi-class SVMs and MaxEnt models

Crossref

Open Access Repository of IISc Research Publications